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DETECTION METHOD FOR C-RAF-1 GENES 



vial A of the Invention 

1 0 The present invention relates to ( 1 ) a method of 

identifying an individual at an increased risk for 
developing cancer, (2) a method for determining a 
prognosis of patients afflicted with cancer, and (3) a 
method for determining the proper course of treatment for 

15 a patient afflicted with cancer. 

Background Information 

Lung cancer claims more lives in the United 

20 States than any other neoplasm (R.S. Finley, Am. Pharm. 
NS29, 39 (1989)), and of the various forms lung 
adenocarcinomas have one of the worst prognoses (T.P. 
Miller, Sftmin. Oncol. 17, 11 (1990)). The incidence of 
adenocarcinoma of the lung (ACL) in the United States is 

25 also guickly rising (I. Linnoila, Hematol. Oncol. North. 
Am. 4. 1027 (1990); J.B. Sorensen, H.H. Hansen, Cancer 
Surviv. 8, 671 (1989)). In order to gain insight into 
this complex and deadly disease, a model system for its 
study has been developed. For such a model to provide 

30 clinically relevant data several criteria must be met. 

The tumors produced should be histologically equivalent to 
their human counterparts, tumor induction must be reliable 
and reproducible, and the numbers generated must be great 
enough to provide statistical significance. To satisfy 

35 these conditions a system has been created which uses two 
inbred mouse strains (NFS/n and AKR) along with 
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transplacental exposure to the potent carcinogen N-ethyl- 
N-nitrosourea (BHD) and promotion with the antioxidant 
butylated hydroxytoluene (BHT) • The resulting tumors were 
examined for altered expression or structural mutations of 
5 genes implicated in lung tumor development such as rajL«_ 
myc, and raf oncogenes (CD. Little et al., Nature. 306, 
194 (1983); P.E- Kiefer et al., Cancer Res. , 47, 6236 
(1987); E. Santos et al., Science 223, 661 (1984); S. 
Rodenhuis, B EngJ - .t. Med. 317, 929 (1987); M. Barbacid, 
10 jig J run. Tnvest. 20, 225 (1990); D.R. Rapp et al., J, 
Tn<- A fiSOC . f ™- "»» Shudv of Tffliq Cancer 4, 162 (1988); 
M.J. Birrer et al., Ann. Rev. Med. 40, 305 (1989); G. 
Sithanandam et al., Oncogene 4, 451 (1989)). 

15 rst proto-oncogenes are evolutionary highly 

conserved genes encoding cytoplasmic serine/ threonine 
specific kinases, which function in mitogen signal 
transduction (reviewed in D.R. Rapp et al., The Oncogene 
Handbook, T. Curran et al., Eds. (Elsevier Science 
Publishers, The Netherlands, 1988), PP- 115-154; D.R. 
Rapp, Qncsaeas 6, 495 (1991)). The three known active 
members in the raf family encode phosphoproteins of 
similar size (72/74 kD for Raf-1 ; 68 kD for A-Raf-1 , and 
74 kD for B-Raf (D.R. Rapp et al., in P^rnvi mses and 
25 Pathology. R. Gallo et al., Eds. (Humana Press, 

Clifton, New Jersey 1985), pp. 449-472; T.W. Beck et al., 
fiaslgi£ _^^ 15, 595 (1987); G. Sithanandam et al 
oncogene 5, 1775 (1990))). Raf-1 was first identified as 
the cellular homologue of v-raf (H.ff. Jansen et al., 
30 Nafcure 307, 218 (1984)), the transforming gene of 3611 MSV 
^X~Rapp et al., J, Virol, 45, 914 (1983); O.R. Rapp et 
al , rr~ Mfel »SA SO, 4218 (1983)). Amino 

acid comparisons of raf family genes shows three conserved 
regions [CM, CR2, CR3] (T.W. Beck et al., BaSlSiS Acids 
35 ReSj. 15, 595 (1987)); CR1 is a regulatory region 

surrounding a Cys finger consensus sequence, CR2 is a 
serine/threonine rich region, and CR3 represents the 



20 



WO 93/06248 



PCT/US92/07817 



-3- 

kinase domain. Raf-1 has been mapped to chromosome 3p25 
in humans (S.J. O'Brien et al., Science 223, 71 (1984)), 
and this region has been found to be frequently altered in 
small cell lung carcinoma (SCLC) (J. Whang-Peng et al., 
5 Pannar Gen*t :. Cvtooenefc. 6, 119 (1982); J.M. Ibson et al., 
■7. oil. Biochem. 33, 267 (1987)), familial renal cell 
carcinoma (A.J. Cohen et al., N- Engl. J. Med. 301, 592 
(1979); G. Kovacs et al., Ipt .T. Cancer 40, 171 (1987)), 
mixed parotid gland tumors (J. Mark et al., Hereditas 96, 
10 141 (1982)), and ovarian cancer (K. Tanaka et al., Cancer 
fanst. Cvtoaenet. 43, 1 (1989)). 

Raf genes are differentially expressed in 
various tissues (S.M. Storm et al., Oncogene, 5, 345 

15 (1990)). c -raf- 1 has been found to be expressed 

ubiquitously, though absolute levels vary between tissues. 
A -raf- 1 is present predominantly in the urogenital 
tissues, whereas B-Raf is most abundant in cerebrum and 
testis. The ubiquitous c-Baf-1 kinase is regulated by 

20 tyrosine and serine phosphorylations that result from 

activated growth factor receptor kinases (O.K. Morrison et 
al., Cell 58, 648 (1989); D.K. Morrison et al., Proc T 
Natl. Acad. Sci. USA 85, 8855 (1989); K.S. Kovacina et 
al., T- Biol. Chem, 265, 12115 (1990); P.J. Blackshear et 

25 al., .7. Biol. Chem. 265, 12131 (1990); M.P, Carroil et 

al., .7. Biol- Chem. 265, 19812 (1990); J.N. Siegel et al., 
.7. Biol. Chem. 265, 18472 (1990); B.C. Turner et al., 
Pr-nn. Natl. H**A. Sci. USA 88, 1227 (1991); M. Baccarini 
et al., embo J. 9, 3649 (1990); H. App et al., Mol. Cell. 

30 Biol. 11,-913 ( 1 991 ) ) . Raf-1 operates downstream of Ras 
in mitogen signal transduction as judged by experiments 
using antibody microinjection (M.R. Smith et al., Nature 
320, 540 (1986)), c-raf-1 antisense expression constructs 
(W. Kolch et al., Nature 349, 426 (1991)), dominant 

35 negative mutants (W. Kolch et al., Nature 349, 426 

(1991)), and Raf revertant cells. Studies with .NIH3T3 
cells and brain tissue demonstrated that mitogen treatment 



WO 93/06248 



PCT/US92/07817 



-4- 



10 



15 



20 



25 



30 



35 



induces Raf-1 kinase activity and causes a transitory 
relocation of the active enzyme from the cytoplasm to the 
nucleus and perinuclear area (Z. Olah et al., Kxp. Brain. 
ReS . (in press); U.R. Rapp et al., in fold Spring Harbor 
er , r ^,- a ™ Qua -"-*"™ Biology. Vol. LIII, Eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
1988) pp. 173-184). 

Raf-1 coupling has been examined in more than a 
dozen receptor systems and all strong mitogens stimulated 
Raf-1 kinase activity (U.R. Rapp, Oncogene 6, 495 (1991); 
^. Morrison et al., Cell 58, 648 (1989); D.K. Morrison 
et al., rmr Nat] - PSA 85, 8855 (1989); K.S. 

Kovacina et al., .T Biol. Chem. 265, 12115 (1990); P.J. 
Blackshear et al., J Biol. Chem., 265, 12131 (1990); M.P. 

Carroll et al., J BlflL chem - 265 ' 19812 (1 "° ); J * N * 
Siegel et al., J Biol. Chem. 265, 18472 (1990); B.C. 
Turner et al., Vrnr Natl Acad. Sci. USA 88, 1227 (1991); 
M. Baccarini et al., S2SP_J. 3649 (1990); H. App et 
al., m„, sell, Biol. 11, 913 (1991)), and this 
stimulation correlated with an increase in E§X-1 
phosphorylation leading to a shift in apparent molecular 

weight. 

SUMMARY OF THE INVENTION 

It is an object of this invention to provide a 
method of identifying an individual at an increased risk 
for developing cancer. 

It is another object of this invention to 
provide a method for determining a prognosis in patients 
afflicted with cancer. 

It is a further object of this invention to 
provide a method for determining the proper course of 
treatment for a patient afflicted with cancer. 
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Further objects and advantages of the present 
- invention will be clear from the description that follows. 

In one embodiment, the present invention relates 
5 to a method of identifying an individual at an increased 
risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
10 classifying an individual having one or more 

mutations in the region as having an increased risk for 
developing cancer. 

In another embodiment, the present invention 
15 relates to a method for determining a prognosis in 
patients afflicted with cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
20 classifying patients having no mutation in said 

region as being less likely to suffer disease relapse or 
having an increased chance of survival than those patients 
having one or more mutations in said region. 

25 m a further embodiment, the present invention 

relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer, comprising: 
amplifying a region of the c-raf-1 gene; 
analyzing products of said amplification for evidence 

30 of mutation; 

identifying a patient having at least one mutation in 

said region, which patient may require treatment proper 

for patients having a lesser chance of survival or 

decreased time to relapse; and 
35 identifying a patient lacking mutations in said 

region, which patients may require treatment proper for 
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patients having a greater chance of survival or being less 
likely to suffer disease relapse. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

Figure 1 . Effect of BHT promotion on ENO 
tumorigenesis in NFS/n x AKR mice. The X-axis represents 
percent tumor induced mortality within each group, and the 
y-axis reflects age in weeks. All animals were exposed to 

10 ENO transplacental^ at a dose of 0.5mM/Kg of mother's 
body weight on day 16 of gestation (presence of vaginal 
plug was scored as day one). At two weeks of age mice 
were weaned into two separate groups and separated by sex. 
Both groups were housed in identical cages and supplied 

15 with food (Purina Lab Chow) and water ad libitum. 

Beginning at three weeks of age, group 2A (0) was given 
weekly intraperitoneal (i.p.) injection of corn oil (0.1 
ml), and group 2B (o) received weekly i.p. injections of 
BHT (20 mg/Kg of body weight) dissolved in corn oil. 

20 Administration of BHT reduces the mean age of mortality 
from approximately 20 weeks to 13, and decreases the 
initial age of mortality. These curves are significantly 
different (p< 0.001) as judged by a 2-tailed Cox test. In 
both groups the rate of tumorigenesis was identical for 

25 males and females. 

Figure 2. Northern blot analysis of proto- 
oncogene expression levels in ENO induced tumors. 

Figure 3. Diagnostic digestion of PCR amplified 
Ki-ras genes from ENO induced tumors. Genomic DNA was 
isolated from a cesium chloride gradient during RNA 
preparations. In each case 10 ng was amplified via PCR 
(95-C, 5 min. followed by 35 cycles of 95°C, 1 mm. ■» 
55"C, 1 min. -» 72% 1 min.) with 2 units of Tag I 
polymerase. The primers used (K1 ; 5'- 
AACTTGTGGTGGTTGGACCT— 3 ' ■* (SEQ ID N0:6) and K2; <- 3'- 
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GTCTTAGTGAAACACCTACT— 5 ' (SEQ ID NO:7)) generate a 79 b.p. 
product. The primer Kl ends at codon 12 and contains a 
mismatch from normal mouse (Ki-£as. sequence its 18th 
nucleotide (G -* C) creating a BstNI site (CCTGG) in 
5 conjunction with a normal codon 12 (GGT) . Digestion of 
amplified product from a normal allele with BstNI produces 
two products of 19 and 60 b.p., whereas a mutation in one 
of the first two positions of codon 12 will eliminate the 
BstNI site. The presence of two normal alleles results in 

10 all of the product being cleaved and the presence of one 
mutant and one normal allele will result in only half of 
the product cut. In the three panels each sample was run 
in duplicate, uncut and cut with BstNI. F1 is DNA from an 
untreated NFS/n X AKR F1 mouse, and MCA5 is a murine cell 

15 line known to harbor a mutant K-ras codon 12 allele. One 
lymphoma (24Ly) and one cell line (117; derived from a 
lung adenocarcinoma) display a mutated Ki-ras codon 12 
allele; however, 24Ly was a passaged tumor and examination 
of the original tumor showed two normal alleles indicating 

20 that this mutation was acquired during passaging. 

Figure 4. c-raf-1 RNAse protection analysis of 
ENU induced tumors. The probe used was a 32E labeled 
antisense transcript from the 3' non-coding region of a 

25 mouse c-raf-1 cDNA to the 3' most StuI site. 

Hybridization of this probe with normal RNA results in a 
protected fragment of 1.2kb covering the region encoding 
the Raf-1 kinase domain. One ug of poly (A) + RNA from each 
tumor and 5 ug of F1 RNA (in order to get comparable 

30 signals) was hybridized for 12 hours at 52'C with 200,000 
cpm of *P labeled mouse c-£§f. antisense transcript. 
Hybrids were then digested for 30 minutes with 25 ug RNAse 
A and 33 units of RNAse T1 at room temperature. Digested 
hybrids were then incubated with 50 ug of proteinase K, 

35 phenol/chloroform extracted, ethanol precipitated, and 
resuspended in loading dye containing 80% formamide. 
Samples were then run on 6% polyacrylamide denaturing 
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sequencing gels at 65 watts. Gels were vacuum dried at 80 
degrees C and exposed to X-ray film. Probe is -dxgested 
probe alone; tRNA is probe hybridized to non-specific RNA, 
v-raf is probe hybridized to RNA from a v-raf transformed 
5 cell line and the bands detected represent singlebase 
mismatches between murine c-^f and v-r*l; NES/AKR Fl i 
probe hybridized with RNA from a normal (untreated) mouse, 
24 LY is probe hybridized with RNA from a lymphoma; and 
the remaining lanes are probe hybridized with RNA xsolated 
10 from lung tumors* 

Figure 5. Schematic of Raf-1 protein showing 
sites of END induced mutations. CR1 , CR2, and CR3 
represent conserved regions 1, 2 and 3. cDNAs were made 

15 from tumor derived poly (A) + RNA using MoMuLV reverse 
transcriptase. Primers (MR1 sequence and MR2 -quence) 
encompassing a 435 base pair region c-^af were then used 
to amplify this region via PGR. The amplification mature 
was tnen run on 1.7% agarose gels and the 435 bp product 

20 Isolated. This isolated fragment was then treated w th T4 
polymerase and cloned into the Hindi sxte of Ml3mp18 for 
sequencing. Another set of primers (EMR1 sequence and 
EMR2 sequence) was designed containing EcoRI sxtes at the 
termini and used to amplify a 609 base pair regxon 

25 (encompassing the original 435 base pair region) • 

isolated products from these reactions were then dxgested 
with EcoRI and cloned into the EcoRI site of KS. 
Sequencing reactions were carried out using the Sequenase 
kit (OSB) according to the recommended protocols for 

30 single and double stranded sequencing. Sequencing 

reactions were run on 6% polyacrylamide denaturxng gels at 
65 watts. Gels were vacuum dried at 80 degrees c and 
exposed to X-ray film. In each case a normal allele was 
also sequenced along with the mutant allele. 



35 
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Figure 6- Schematic for Identifying c-raf-1 
mutations. Primers 1 and 2 are shown in SEQ ID NO: 8 and 
SEQ ID NO: 9, respectively. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods that 
involve amplifying a region of the c-raf-1 gene (the 
sequence of a mouse c-raf-1 gene is shown in SEQ ID NO: 10; 
10 the nucleotide and corresponding amino acid sequence of a 
human c-raf-1 gene is shown in SEQ ID NO: 11 and SEQ ID 
NO:12 f respectively). 

In one embodiment, the present invention relates 

15 to a method of identifying an individual at an increased 
risk for developing cancer (preferably, lung cancer, T- 
cell lymphomas, renal cell carcinoma, ovarian carcinoma, 
and mixed parotid gland tumors) comprising: amplifying a 
region (preferably by using the polymerase chain reaction 

20 method (PCR) or by cloning techniques) of the c-raf-1 gene 
of the individual (SEQ ID NO:11)(in one preferred 
embodiment, the region encodes amino acids 514 to 535 of 
SEQ ID NO: 12); analyzing products of the amplification for 
evidence of mutation (preferably by DNA sequencing of the 

25 region) and classifying an individual having one or more 
mutations in the region as having an increased risk for 
developing cancer. In one prefered embodiment, the region 
encodes amino acids 500 to 550 of SEQ ID NO: 12 or amino 
acids 450 to 630 of SEQ ID NO: 12. In another prefered 

30 embodiment, the PGR method employs a primer comprising the 
sequence shown in SEQ ID NO: 7 and a primer comprising the 
sequence shown in SEQ ID NO: 8. In another prefered 
embodiment, the method comprises the steps shown in Figure 

35 in another embodiment, the present invention 

relates to a method for determining a prognosis in a 
patient afflicted with cancer (preferably, those cancers 
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listed above). The method comprises: amplifying the 
region of the c-raf-1 gene as described above; analyzing 
products of the amplification for evidence of mutation as 
described above; and classifying a patient having no 
mutation in the region as being less likely to suffer 
disease relapse or having an increased chance of survival 
than a patient having one or more mutations in the region. 

In another embodiment, the present invention 
relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer (preferably, 
those cancers listed above), comprising: amplifying a 
region (described above) of the c-raf-1 gene as described 
above; analyzing products of the amplification for 
evidence of mutation as described above; identifying a 
patient having at least one mutation in the region, which 
patient may require treatment proper for patients having a 
lesser chance of survival or decreased time to relapse; 
and identifying a patient lacking mutations in the region, 
which patients may require treatment proper for patients 
having a greater chance of survival or being less likely 
to suffer disease relapse. 

Administration of therapeutic agents (cytotoxic 
or cytostatic) tailored to recognize the mutant Raf-1 
protein but not normal Raf-1 could specifically target 
tumor cells for death of growth inhibition. Such agents 
could be comprised of cytotoxic T-cells, antibodies, 
and/or specifically designed chemical compounds. 

The following Examples demonstrate consistent 
point mutations of the c-raf-1 proto-oncogene, within a 
small region of the kinase domain, in a mouse model for 
chemical tumor induction. This is the first demonstration 
of point mutated raf genes in vivo, and the first 
isolation of activating ia yiyo. point mutations in the 
kinase domain of a proto-oncogene. The tumors examined 
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show a selective specificity for Raf-1 mutations as 
another family of genes, the ras proto-oncogenes which are 
frequently activated by point mutation in both animal and 
human tumors (S. Rodenhuis et al., Am. Rev. Respir. Pis. 
142, S27-30; T.R. Devereux et al., Carcinogenesis 12, 299 
(1991)), is not involved. 

The present invention is described in further 
detail in the following non-limiting examples. 

EXAMPLES 

The following protocols and experimental details 
are referenced in the examples that follow: 

15 RNA Isolation. Tumors were excised, a small portion 
minced in PBS (phosphate buffered saline solution) for 
passaging in nude mice, frozen immediately in a dry 
ice/ethanol bath, and stored at -70° until RNA extraction. 
Frozen tissues were minced on wet ice in a guanidine 

20 thiocyanate buffer (4M guanidine thiocyanate 10mM EDTA, 2% 
N-lauryl sarcosine, 2% beta-mercaptoethanol, 10mM Tris 
(pH=7.6)), disrupted in a Dounce homogenizer, and 
extracted three times with phenol: chloroform: isoarayl 
alcohol (24:24:2). Supernatants were then transferred to 

25 SW41 tubes, 100 >ig of cesium chloride per ml was added to 
the supernatant which was then underlayed with one half 
saturated cesium chloride in 10mM EDTA (pH=7-0; index of 
refraction 1.3995-1.4000), and centrifuged at 25,000 rpm 
for 20 hours in a Sorvall SW-41TI rotor using a Beckman 

30 model L5-50 ultracentrifuge. Supernatants were removed 
and RNA pellets dissolved in 4 ml resuspension buffer ( 1 0 
mM Tris-HCl pH=7.6, 5% beta-mercaptoethanol, 0.5% N-lauryl 
sarcosine, 10 mM EDTA), extracted once with 
phenol: chloroform :isoamyl alcohol, sodium acetate added to 

35 0.1 2M and RNA precipatated with two volumes ethanol at - 
20°C overnight. Precipitates were centrifuged at 9,000 rpm 
in a Sorvall SS-34 rotor for 30 minutes, and pellets 
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redissolved in RNA sample buffer (10 mM Tris P H=7.4, ImM 
EDTA, 0.05% sodium dodecyl sulfate) and concentrations 
determined by absorbance at 260 nm. Poly (A)* RNA was 
isolated by binding to oligo dT cellulose columns in high 
salt (10mM Tris P H=7.4, I* EDTA, 0.05% SDS, SOOrnM NaCi), 
and eluting with RNA sample buffer heated to 40 C. 

-^■wn ttlnfctina. 5 ug PolyU)' RNA per lane was ethanol 
precipitated, desiccated, resuspended in loading buffer 
(20mM MOPS P H=6.8, 5mM sodium acetate, ImM EDTA, 50% 
formamide, 6% formaldehyde), heated at 65°C for 5 min., 
guick chilled on wet ice for 10 min., and electrophoresed 
through a 0.7% agarose gel containing 2.2 M formaldehyde, 
20mM MOPS [pH=6.8], 5mM sodium acetate, and ImM EDTA. 
Gels were then blotted overnight onto nitrocellulose 
filters via capillary transfer in 20X SSC, filters were 
washed in 3X SSC for 10 min. and baked at 80°C for 2 
hours . 

actions. Filters were prehybridized at 42'C in 5X 
SSC, 50% formamide, 20mM sodium phosphate pH=6.8, 200 
ug/ml PVP-40, 200 ug/ml ficoll 400, 200 ug/ml bovine serum 
albumin, and 200 ug/ml sonicated sheared salmon sperm DMA. 
Blots were. then hybridized with 500,000 cpm/ml of random 
primed *P labeled probes overnight at 42°C in 
prehybridization solution with 5% dextran sulfate. Blots 
were washed with agitation in 2X SSC, 0.1% SDS at room 
temperature six times for 20 minutes each wash, then 
washed once at 45°C in 0.1X-SSC for 15 minutes. Fxlters 
were exposed to X-AR 5 film at -70°C. 

SAMPLE 1 
Tumor I nduction 

NFS female mice were mated with AKR males and 
pregnant females given a transplacental injection of 1- 
ethyl-1-nitrosourea (END) at a dosage of 0.5 mM/Kg 
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mother's body weight on day 16 of gestation, counting plug 
date as day one. ENU was chosen for tumor induction since 
it is a very potent direct acting carcinogen capable of 
modifying any base in vivo (Singer, B. et al., 1983. 
5 Molecular Biology of Mutagen s and Carcinogens, Plenum 
Press, New York). ENU alkylates all tissues with roughly 
the same efficiency (E. Scherer et al., Cancer Lett. 46, 
21 (1989)) and has a very short half life in vivo (E.M. 
Faustman et al., Teratology. 40,. 199 (1989)) allowing 
10 specific mutagenesis of tissues which are mitotically 

active at a particular time. NFS and AKR were chosen as 
parental strains based on earlier studies which showed 
them to be particularly susceptible to ling tumors 
following ENU exposure (B.A. Diwan et al., Cancer Res. 34, 
15 764 (1974); S.L. Kauffman, JNCI 57, 821 (1976)). With 
this procedure nearly 100% of the offspring develop lung 
adenocarcinomas and approximately 70% develop, in 
addition, T-cell lymphomas with a mean latency of 
approximately 20 weeks. In order to achieve more rapid 
tumor development, weanling mice were treated with weekly 
injections of a tumor promoter, the antioxidant butylated 
hydroxy toluene or BHT (20mg/kg body weight dissolved in 
corn oil). BHT was used as it has been demonstrated to 
cause lung, lesions, and hyperplasia when injected into mice 
(A. A. Marino et al., »™r soc. Fxn, Biol. Med. 140, 122 
(1972); H. Witschi et al., Proc. Soc f Fxp. Biol. Med. , 
147, 690 (1974); N. Ito et al., CRC Crit. Rev. Toxicol. 
15, 109 (1084)). in the present system it nearly doubles 
the rate at which tumors develop. Figure 1 compares tumor 
induced mortality with age of animals for those receiving 
ENU alone, and those receiving ENU and promoted with BHT. 
These curves demonstrate that when BHT is given the mean 
age of tumor induced mortality decreases from 
approximately 20 weeks to around 12, and there is also a 
decrease in initial latency. These curves are 
significantly different with a confidence limit greater 
than 99.99% using a 2-tailed Cox test. In addition, BHT 
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promotion, while increasing the rate at which tumors 
develop, does not affect the tumor spectrum. 
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KXAMPLE 2 
nnnoaen" Repression 



Northern blot analysis revealed elevated levels 
of c-raf-1, as compared to normal tissue, in every tumor 
examined (Figure 2), and Western blot analysis showed that 

10 protein levels correlated with message levels (U.R. Rapp 
et al., in QncgggaM and Cancer, S.A. Aaronson et ml.. 
Eds. (Tokyo/VNU Scientific Press, Tokyo, 1987) pp. 55-74). 
in addition, in cell lines derived from primary tumors 
Raf-1 protein kinase activity was shown by immune-complex 

15 kinase assays to be constitutive. Further analysis of 
other oncogenes revealed no consistent pattern of 
expression except for ras and myc family genes. In the 
case of the myc family, one member (either c-, N-, or L- 
mvc) was overexpressed but never more than one. For the 

20 ras genes, at least one member (Ki-, Ha-, or N-ras.) and 
o^en more than one, was expressed at high levels when 
compared with the normal tissue. In addition all 
oncogenes examined via Northern analysis exhibited full 
length, normal sized transcripts. 

ras. genes were considered likely candidates for 
mu tational activation since oncogenic forms of Ki-ras have 
previously been observed in lung tumors (S. Rodenhuxs et 
al., m — gs§Ei£j 1«. S27-30; «... Devereux et 

a! , ^Hnnaenesis 12, 299 (1991)) and END is a point 
mutagen (Singer, B. et al. , 1983. Modular Biology of 
^-t-- ™« r^nocens. Plenum Press," New York). A 
systematic analysis of various ras codons known to be 
involved in oncogenic activation was therefore performed . 
Ha- Ki-, and N— ras were examined at codons 12, 13, and 
61 for potential mutations via RNAse protection assays 
(R.M. Myers et al., S^n- 230, 1242 (1985); E. Winter et 
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al., PT-or Natl. Sr.! . USA 82, 7575 (1985)), PCR 

amplification followed by subsequent sequencing (F. Sanger 
et al., -t MoT . Biol. 13, 373 (1975)), and PCR 
amplification followed by diagnostic restriction digests 
5 (W. Jiang et al-, oncogene 4, 923 (1989)). PCR 

amplification creating diagnostic enzyme sites is a very 
efficient way of examining alleles for mutations at known 
sites and involves designing a PCR primer whose 3» end 
lies next to and produces a novel restriction site 
10 encompassing the codon of interest. Following 

amplification, PCR products from normal alleles will 
contain the new restriction site, while mutant alleles 
will not. Digestion of the product from tissue with two 
normal alleles results in all product being cut; however, 
15 if one allele contains a mutation, only half of the 

product will be digested. Figure 3 shows the results of 
amplification and diagnostic digestion applied to Ki-£as 
codon 12 in several tumors and cell lines. The first 
panel is from a set of lymphomas. F1 is DNA from a normal 
20 untreated mouse and both alleles are cut by BstNl, 

indicating the presence of two normal alleles. MCA5 is a 
murine cell line known to contain a Ki-£2£ codon 12 
mutation (L.F. Parada et al., MqI , Cell. Biol. 3, 2298 
(1983)), and only the amplified normal allele is cleaved. 
25 Of the five tumors shown in the second panel, one shows a 
mutant Ki-ras allele. The next panel shows some of the 
lung tumors tested and none of them exhibit a mutant 
allele, and the final panel shows tumor derived cell 
lines. The first three are from lymphomas and the last 
30 three from lung adenocarcinomas. One lung tumor line 
<0117) has a Ki-ras. 12 mutation that was not present in 
the primary tumor but came up upon transplantation. This 
analysis has been performed with Ki, Ha and N-ras genes at 
codons T 2 and 61 . Of all the tumors and cell lines 
35 examined by this method for mutations of the three ras 
genes at codons 12 and 61, the two shown here were the 
only ones detected. Examination of codon 13 was done by 
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PCR amplification of genomic DNA surrounding codon 13 
followed by cloning into KS + (Stratagene) and double 
stranded sequencing. Table I summarizes the ras mutation 
data. The most notable point from this table is the 
5 conspicuous lack of ras mutations in these tumors. In 
fact the number of ras. mutations is much lower than would 
be expected for a sampling of spontaneous tumors (S. 
Rodenhuis et al., M *espir. 142 ' S27 " 30? T ' R ' 

Devereux et al., C^rcinpgenggls 12, 299 (1991); J.L. Bos, 
10 r^r Res. 49, 4682 (1989)). Having eliminated ras genes 
as playing a primary role in the genesis of these ENU 
induced tumors, c-raf-1 was investigated for possible 
small or point mutations. 

15 TABLE I 

Tumors, and Cell Line - Positive for ras Mutations 
Codon 12 Codon 13 Codon 61 



20 Ha-ras 
Ki/ras 
N-ras 



Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


0/10 


0/6 


0/6 


0/2 


0/10 


0/6 


i7io 


1/6 


0/6 


0/2 


0/6 


0/2 


0/10 


0/6 


0/6 


0/2 


0/10 


0/6 



• This was a second passage tumor in which the original 
25 tumor did not contain a Ki-ras mutation. 

Table I: Summary of mutation analysis for Ha-, Ki-, and 
N-ras at codons 12, 13, and 61. Each box displays the 
number of mutations detected, over the number of tumors 
30 and tumor derived cell lines examined via RNAse 

protection, sequencing or diagnostic digestion, for each 
of the nine codons. 

TCYAMPLE 3 

35 H"^ f4 " ng in Raf " 1 

Since no point mutations had been described 
for a£ genes in vivo, as had been for the ras genes (E. 
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Santos et al., Science 223, 661 (1984); S. Rodenhuis, L 
Engl. J. Med. 317, 929 (1987); M. Barbacid, Eur. J. 
run, invest. 20, 225 (1990); F. Sanger et al., J f Mol. 
Biol. 13, 373 (1975)), point mutations were screened for 

5 using RNAse protection assays (R.M. Myers et al., 

Science 230, 1242 (1985); E. Winter et al., Proc. Natl. 
Acad. Sci. USA 82, 7575 (1985)). Figure 4 shows a 
typical protection assay using a c-raf-1 probe. In this 
experiment the probe used covered the 3' end of raf-1 

10 from the 3' most StuI site to the end of the coding 

sequence. The first lane is a marker (pBR322 digested 
with Haelll), the second shows the probe alone 
undigested, the third lane shows the probe hybridized to 
unrelated RNA in this case tRNA, the fourth lane shows 

15 hybridization with v-raf transformed cells and the lower 
bands represent cleavage at points where the mouse c- 
raf-1 gene differs from v-raf. The fifth lane shows 
hybridization with RNA isolated from a normal lung of an 
untreated F1 mouse, the next lanes are RNA isolated from 

20 several tumors. In the case of the normal RNA, only 

one, fully protected, band is detected while in the case 
of the tumors two major bands are seen after digestion. 
20 out of 20 tumors analyzed in this fashion showed this 
extra band. These data demonstrate the following major 

25 points: 1) there is a tumor specific alteration in c- 
ra f- 1 that results in a region of non-homology 
recognizable by either RNAse A or T1 ; 2) The alterations 
are confined to the same region of one allele as two 
bands of equal size are present in the tumor lanes, and; 

30 3) both alleles were expressed at comparable levels as 
both bands are of approximately equal intensity. In the 
assay shown 5 ug of poly(A) + RNA was hybridized from 
normal tissue, and 1 ug was used from the tumors. This 
was necessary to get signals that could be compared on 

35 the same gel due to the overexpression of c-£§f-1 in the 
tumors. By running these assays with various, markers it 
was possible to estimate the approximate site of the 
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alteration(s) to be in the vicinity of the exon 14 exon 
15 junction. In order to define the precise genetic 
alteration or alterations, PGR primers were designed 
which would generate a 600 bp fragment encompassxng thxs 
5 region. cDNAs fro. tumor derived RNA were then 

amplified and cloned into KS + (Stratagene) for double 
stranded sequencing. The sequencing results from 
several tumors are shown in Figure 5. The top portxon 
of Figure 5 presents a cartoon of the mouse Raf-1 
l0 protein! There are three conserved regions GR1 «- 
CR3 with CR3 representing the kinase domain. The probe 
used in the RNAse protection assays covers the xndxcated 
area, and the PGR primers amplified the bracketed 
^gion. Sequencing through this area revealed a varxety 
15 of mutations just downstream of the APE sxte These 

mutants are shown in an expanded version at the bottom 
of Figure 5 (See also SEQ ID N0:1 for normal mouse 
sequence and SEQ ID N0:2. 3, 4, and 5 for mutant 
sequences). These mutants were isolated from four 
separate tumors, and in each case a normal allele (SEQ 
ID NO:1) was also sequenced. Repeating the cDNA 
synthesis, PGR amplification, cloning and sequencxng 
gives the same sequence and normal tissue shows no 
mutations demonstrating that these alterations are not 
artifactual. Sequence covering the amplified region has 
been examined and it is interesting that all of these 
changes occur within a very small region of the raf 
^lin. In fact the region where these mutations occur 
overlaps an epitope shared by monoclonal antxbodies 
generated against raf (W. Kolch et al. , ^ 
O990», -d computer modeling of tbe protexn shows this 
to be a hydrophilic domain, the structure of which xs 
predicted to be altered by these mutations. This 
indicates a biologically important region for the 
molecule and indeed the first of these mutatxon tested 
in NIH3T3 cell assays, after cloning into a retroviral 
expression vector (E1-neo, (G. Heidecker et al., MbU 
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Cell. Biol. 10, 2503 (1990))), was found to be weakly 
transforming when driven by a Moloney LTR. The 
transformation efficiency was comparable to EC2, a 
previously characterized mutation of human c -raf- 1 cDNA 
5 (G. Heidecker et al., Mol. Cell. Biol. 10, 2503 (1990); 
C. Wasylyk et al., Mol. Cell. Bior. 9, 2247 (1989)) and 
-20 fold lower than the v-raf oncogene. 



* * * * * 
10 All publications mentioned hereinabove are 

hereby incorporated in their entirety by reference. 

While the foregoing invention has been 
described in some detail for purposes of clarity and 
15 understanding, it will be appreciated by one skilled in 
the art from a reading of this disclosure that various 
changes in form and detail can be made without departing 
from the true scope of the invention and appended 
claims . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Rapp, Ulf R. 

Storm, Stephen M. 

(ii) TITLE OF INVENTION: DETECTION METHOD FOR C-RAF-1 GENES 
(iii) NUMBER OF SEQUENCES: 12 

ttvl CORRESPONDENCE ADDRESS: 

( ' (A) ADDRESSEE: NEEDLE & ROSENBERG, P.C. 

(B) STREET: Suite 400, 133 Carnegie Way, N.W. 

(C) CITY: Atlanta 

(D) STATE: Georgia 

(E) COUNTRY: USA 

(F) ZIP: 30303-1031 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C OPERATING SYSTEM: PC-JOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vi ii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Perryman, David G. 

(B) REGISTRATION NUMBER: 33,438 

(C) REFERENCE/DOCKET NUMBER: 1414.0421 

fix) TELECOMMUNICATION INFORMATION: 
1 ' (A) TELEPHONE: (404) 688-0770 
(B) TELEFAX: (404) 688-9880 

(2) INFORMATION FOR SEQ ID N0:1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 1U 
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Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 * 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 " 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
r 115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 . 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 190 

leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 " 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
P 275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
2go 295 300 
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Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 3 JU 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

340 345 
Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 

355 360 
His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 3 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 3 9° J * 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys tap Asn 
405 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 w 



Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 

450 455 
lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 ^ 
Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 

515 520 
Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 
Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 



545 
Val 



550 



Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 



565 



»s Pro l» AU Met I* An, Leu Val Ale Asp Cys Val lys Lys V.1 
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Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser lie Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
I 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 * 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys AU Arg Leu 
100 105 HQ 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 I 85 

Leu Leu Leu Phe Pro Asn Ser Thr Vat Gly Asp Ser Gly Val Pro Ala 
195 200 " 3 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 UK) 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 £3vJ *• 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 

Ala lie Arg Ser His Ser 
275 280 285 



260 265 
Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 



Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 JUU 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

Thr Val Tyr 
355 360 365 



340 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 



His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 
Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 qvJ 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 * a 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 wa 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn lie 



450 
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Ile His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu 61y Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val Val Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 648 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys lie Ser Pro Thr He 

• 20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 
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Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

55 0 



50 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 



100 



Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 

115 1ZU 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 * 35 lW 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 



130 135 140 

Leu Lys Leu Ala Phe Cys Asp He 
145 150 I 55 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 

180 I 85 
Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 " u 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 

Asn Thr Ser Ser Pro. Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 



245 



Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 

260 265 
Asp Ser Ag Met n. «> Asp AU He Arg Ser H. |er GU Ser Ala 



275 



Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 

290 295 
Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 J " 

fly Ser Gly Thr Gin Gin lys Asn lys lie Arg Pm Ar, Gly Gin Arg 
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Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu lie Asp lie 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
3 595 3 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 
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Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 648 amino acids 
B) TYPE: amino acid 



!S1 



TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 

1 ^ 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 H3 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

50 55 
Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 



65 



Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 

100 105 
Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 

115 120 
Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 135 
Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 x ^ 

to. ay Phe Ar, Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
ion 1" 



180 

195 



Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
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Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met lie Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys . Val Val Asp Pro Thr Pro 
370 .375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
3 405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 
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Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Ser Gin 
51 5 520 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 a" 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 

610 615 
Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 63° 635 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ 10 N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 . 10 19 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 



20 



Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 4J 



Thr Asp Ser Ser Lys Thr Ser Asn Thr lie Arg Val Phe Leu Pro Asn 

55 



50 



Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 * 70 
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Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
U5 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 1« 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys " 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 190 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 . 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg GTn Arg 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 Z70 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 w 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 «l) 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 
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His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn lie Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala lie Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys lie Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 5>i u 

Ala Pro Glu Val lie Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Thr Cys Thr Phe Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 5>»° 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
5g5 600 505 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 515 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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(2) INFORMATION FOR SEQ ID N0:6: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

AACTTGTGGT GGTTGGACCT 20 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TCATCCACAA AGTGATTCTG 20 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: . 

AGGAGACCAA GTTTCAGATG 20 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GCGTGCAAGC ATTGATATCC 20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1947 base pairs 
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(B) TYPE: nucleic add 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGAGCACA 


TACAGGGAGC 


TTGGAAGACfa 


ATTAErAATG 


CCTTTGGACT 


CAAAGATGCG 


60 


GTGTTTGATG 


GCTCCAGCTG 


A T/*Tf»r P PT 

CATCTCCCCT 


APPATTPTTP 


AGCAGTTTGG 


CTATCAGCGC 


120 


CGGGCCTCAG 


ATGATGGCAA 


GCTCACGGA1 


TPTTrTAAGA 


CAAGCAATAC 


TATCCGGGTT 


180 


TTCTTGCCGA 


ATAAGCAAAG 


P A PTPTPPTP 

GACTGT6G1 u 


AATGTGfGGA 


ATGGAATGAG 

fy i VJ 1 w*^w; 


CTTACATGAC 


240 


TGCCTTATGA 


AAGCTCTGAA 


r*r*Tr* a/* A ppp 

GGTGAGAGGC 


L I uLMb L. Lrtw 


AGTGCTGTGC 

Au 1 WW 1 U 1 Uw 


AGTGTTCAGA 


300 


CTTCTCCAGG 


AACACAAAGG 


TAAGAAAGCA 


rrrTTAGATT 

Lull lAuHl 1 


GGAACACCGA 


TGCCGCCTCT 


360 


CTGATTGGAG 


AAGAACTGCA 


AGTGGAl 1 1 i 


1 I buA 1 Urt I a 


TTCCCATCAC 

t 1 WWW-JM vnv* 


AACTCACAAC 


420 


TTTGCTCGGA 


AAACGTTCCT 


GAAGCTTGCA 


1 J L 1 u 1 uALM 


TPTGTCAGAA 


GTTCCTGCTA 


480 


AATGGATTTC 


GATGTCAGAC 


TTGTGGCTAC 


A APTTTPATP 

AAb 1 1 I uh i a 


AGCACTGTAG 


CACCAAAGTA 


540 


CCTACTATGT 


GTGTGGACTG 


/»A/*TA AT ATP 

GA6TAAI Alu 


agapagptct 

AuALAuL 1 t- I 


TGCTGTTTCC 

I Uv 1 W 111 w 


AAATTCCACT 


600 


GTTGGTGACA 


GTGGAGTCCC 


a A pp A l»pT 


TpTTTrrrAA 

1 u 1 1 1 w»wv\ 


TGCGTCGGAT 


GCGAGAATCT 


660 


GTTTCCCGGA 


T6CCTGCTAG 


TTr* , /» A PP A P 

TTCCCA6CAL 


ACATAfTfXA 


CACCCCATGC 

un«M w w* » ■ w. w 


CTTCACTTTC 


720 


AACACCTCCA 


6CCCTTCCTC 


AGAAGGTTCC 


rrrTrrrACA 


GGCAGAGGTC 


AACGTCCACT 


780 


CCCAATGTCC 


ACATGGTCAG 


CACCACCCTG 


rATCTCGAPA 


GCAGGATGAT 


TGAGGATGCA 


840 


ATTCGAAGTC 


ACAGTGAATC 


AGCCTCACCl 


I LAu LLL 1 U 1 


CCAGCAGCCC 

UVTtvl www 


AAACAACCTG 


900 


GGTCCAACAG 


GCTGGTCACA 


a/km m a a a App 

GCCCAAAACC 


LLLu 1 aUUMu 


CACAAAGAGA 


GCGGGCACCA 


960 


GGATCTGGGA 


CCCAGCAAAA 


A A A f*A A A ATT 

AAACAAAAI 1 


AP.CPPTPGTG 
nuuLW i LU i u 


GGCAGAGAGA 


CTCGAGTTAT 


1020 


TACTGGGAAA 


TAGAAGCCAG 


TGAGGTGATG 


PTPTPTfl TTP 


GGATCGGGTC 

QUA 1 LUUu 1 w 


AGGTTCCTTT 


1080 


GGCACTGTGT 


ACAAGGGCAA 


GTGGCATGGA 


GATGTTGCAG 


TAAAGATCCT 


AAAGGTGGTT 


1140 


GACCCAACTC 


CAGAGCAACT 


TCAGGCCTTC 


AGGAACGAGG 


TGGCTGTTTT 


GCGCAAAACA 


1200 


CGGCATGTTA 


ACATCCTGCT 


GTTCATGGGG 


TACATGACAA 


AGGACAACCT 


GGCGATTGTG 


1260 


ACTCAGT6GT 


GTGAAGGCAG 


CAGTCTCTAC 


AAACACCTGC 


ATGTCCAGGA 


GACCAAATTC 


1320 


CAGATGTTCC 


AGCTAATTGA 


CATTGCCCGA 


CAGACAGCTC 


AGGGAATGGA 


CTATTTGCAT 


1380 


GCAAAGAACA 


TCATCCACAG 


AGACATGAAA 


TCCAACAATA 


TATTTCTCCA 


TGAAGGCCTC 


1440 


ACGGTGAAAA 


TTGGAGATTT 


T6GTTTGGCA 


ACAGTGAAGT 


CACGCTGGAG 


TGGTTCTCAG 


1500 


CAGGTTGAAC 


A6CCCACTGG 


CTCTGTGCTG 


TGGATGGCCC 


CAGAAGTAAT 


CCGGATGCAG 


1560 



WO 93/06248 



PCT/US92/07817 



-35- 

GATGACAACC CGTTCAGCTT CCAGTCCGAC GTGTACTCGT ACGGCATCGT GCTGTACGAG 1620 

CTGATGGCTG GGGAGCTTCC CTACGCCCAC ATCAACAACC GAGACCAGAT CATCTTCATG 1680 

GTAGGCCGTG GGTATGCATC CCCTGATCTC AGCAGGCTCT ACAAGAACTG CCCCAAGGCA 1740 

ATGAAGAGGT TGGTGGCTGA CTGTGTGAAG AAAGTCAAAG AAGAGAGACC TTTGTTTCCC 1800 

CAGATCCTGT CTTCCATCGA GCTGCTTCAG CACTCTCTGC CGAAAATCAA CAGGAGCGCC 1860 

TCTGAGCCTT CCCTGCATCG GGCAGCTCAC ACTGAGGACA TCAATGCTTG CACGCTGACT 1920 

ACATCCCCAA 6GCTACCAGT CTTCTAG . 1947 

(2) INFORMATION FOR SEQ 10 NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1.-1944 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GAG CAC ATA CAG GGA GCT TGG AAG ACG ATC AGC AAT GGT TTT GGA 48 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

TTC AAA GAT GCC GTG TTT GAT GGC TCC AGC TGC ATC TCT CCT ACA ATA 96 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

GTT CAG CAG TTT GGC TAT CAG CGC CGG GCA TCA GAT GAT GGC AAA CTC 144 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

ACA GAT CCT TCT AAG ACA AGC AAC ACT ATC CGT GTT TTC TTG CCG AAC 192 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

AAG CAA AGA ACA GTG GTC AAT GTG CGA AAT GGA ATG AGC TTG CAT GAC 240 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 
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TGC CTT 
Cys Leu 

GCA GTG 
Ala Val 

GAT TGG 
Asp Trp 



ATG AAA 
Met Lys 
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GCA CTC AAG GTG AGG GGC 



Ala Leu Lys 
85 



Val Arg Gly 
90 



CTG CAA CCA GAG TGC TGT 288 

Leu Gin Pro Glu Cys Cys 
95 



TTC AGA CTT CTC CAC 

Phe Arg Leu Leu His 
100 

AAT ACT GAT GCT GCG 
Asp Ala Ala 



GAA CAC AAA GGT AAA AAA GCA CGC TTA 336 
Gly Lys 



Asn Thr 
115 



Glu His Lys 
105 



TCT TTG ATT GGA GAA 
Gly Glu 



Ser Leu He 
120 



Lys Ala Arg Leu 
110 

GAA CTT CAA GTA 384 

Glu Leu Gin Val 
125 



GAT TTC CTG GAT 

Asp Phe Leu Asp 
130 

ACG TTC CTG AAG 
Leu Lys 



Thr Phe 
145 

AAT GGA 
Asn Gly 

AGC ACC 
Ser Thr 

CTC TTA 
Leu Leu 



CAT GTT CCC CTC ACA ACA 

His Val Pro Leu Thr Thr 
135 

CTT GCC TTC TGT GAC ATC 

Leu Ala Phe Cys Asp He 
150 



CAC AAC TTT GCT CGG AAG 432 

His Asn Phe Ala Arg Lys 
140 

TGT CAG AAA TTC CTG CTC 480 
Lys Phe 



Cys Gin 
155 



Leu Leu 
160 



TTT CGA 
Phe Arg 



TGT CAG ACT TGT GGC TAC AAA TTT 

Lys Phe 



Cys Gin Thr 
165 



AAA GTA CCT ACT ATG 

Lys Val Pro Thr Met 
180 

TTG TTT CCA AAT TCC 
Pro Asn Ser 



Leu Phe 
195 



Cys Gly Tyr 
170 



TGT GTG GAC TGG AGT 

Cys Val Asp Trp Ser 
185 

ACT ATT GGT GAT AGT 
Asp Ser 



Thr He Gly 
200 



CAT GAG CAC TGT 528 

His Glu His Cys 
175 

AAC ATC AGA CAA 576 

Asn He Arg Gin 
190 

GGA GTC CCA GCA 624 
Pro Ala 



Gly Val 
205 



CTA CCT TCT TTG 

Leu Pro Ser Leu 
210 

CCT GTT AGT TCT 
Ser Ser 



ACT ATG CGT CGT ATG CGA 

Thr Met Arg Arg Met Arg 
215 



GAG TCT GTT TCC 

Glu Ser Val Ser 
220 



AGG ATG 672 
Arg Met 



Pro Val 
225 

AAC ACC 
Asn Thr 



CAG CAC AGA TAT TCT ACA CCT CAC GCC TTC ACC TTT 720 
Tyr Ser Thr 



Gin His Arg 
230 



Pro His 
235 



Ala Phe Thr Phe 
240 



TCC AGT 
Ser Ser 



CCC TCA TCT GAA GGT TCC CTC TCC CAG AGG CAG AGG 768 

Leu Ser 



Pro Ser Ser 
245 



Glu Gly Ser 
250 



Gin Arg Gin Arg 
255 



TCG ACA TCC ACA CCT AAT GTC CAC ATG GTC AGC ACC ACG CTG CCT GTG 816 
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Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 
260 265 270 

GAC AGC AGG ATG ATT GAG GAT GCA ATT CGA AGT CAC AGC GAA TCA GCC 864 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

TCA CCT TCA GCC CTG TCC AGT AGC CCC AAC AAT CTG AGC CCA ACA GGC 912 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

TGG TCA CAG CCG AAA ACC CCC GTG CCA GCA CAA AGA GAG CGG GCA CCA 960 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

GTA TCT GGG ACC CAG GAG AAA AAC AAA ATT AGG CCT CGT GGA CAG AGA 1008 

Val Ser Gly Thr Gin Glu Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

GAT TCA AGC TAT TAT TGG GAA ATA GAA GCC AGT GAA GTG ATG CTG TCC 1056 

Asp Ser Ser Tyr Tyr Trp Glu lie Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

ACT CGG ATT GGG TCA GGC TCT TTT GGA ACT GTT TAT AAG GGT AAA TGG 1104 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

CAC GGA GAT GTT GCA GTA AAG ATC CTA AAG GTT GTC GAC CCA ACC CCA 1152 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

GAG CAA TTC CAG GCC TTC AGG AAT GAG GTG GCT GTT CTG CGC AAA ACA 1200 

Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

CGG CAT GTG AAC ATT CTG CTT TTC ATG GGG TAC ATG ACA AAG GAC AAC 1248 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

CTG GCA ATT GTG ACC CAG TGG TGC GAG GGC AGC AGC CTC TAC AAA CAC 1296 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 hJU 

CTG CAT GTC CAG GAG ACC AAG TTT CAG ATG TTC CAG CTA ATT GAC ATT 1344 
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Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 w 

GCC CGG CAG ACG GCT CAG GGA ATG GAC TAT TTG CAT GCA AAG AAC ATC 1392 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

ATC CAT AGA GAC ATG AAA TCC AAC AAT ATA TTT CTC CAT GAA GGC TTA 1440 
lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 

ACA GTG AAA ATT GGA GAT TTT GGT TTG GCA ACA GTA AAG TCA CGC TGG 1488 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 

AGT GGT TCT CAG CAG GTT GAA CAA CCT ACT GGC TCT GTC CTC TGG ATG 1536 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 505 510 

GCC CCA GAG GTG ATC CGA ATG CAG GAT AAC AAC CCA TTC AGT TTC CAG 1584 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

TCG GAT GTC TAC TCC TAT GGC ATC GTA TTG TAT GAA CTG ATG ACG GGG 1632 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 
530 535 34U 

GAG CTT CCT TAT TCT CAC ATC AAC AAC CGA GAT CAG ATC ATC TTC ATG 1680 

Glu Leu Pro Tyr Ser His He Asn Asn .Arg Asp Gin He He Phe Met 
545 550 555 

GTG GGC CGA GGA TAT GCC TCC CCA GAT CTT AGT AAG CTA TAT AAG AAC 1728 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
5fi5 570 3/3 

TGC CCC AAA GCA ATG AAG AGG CTG GTA GCT GAC TGT GTG AAG AAA GTA 1776 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 3 * u 

AAG GAA GAG AGG CCT CTT TTT CCC CAG ATC CTG TCT TCC ATT GAG CTG 1824 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
* 5g5 600 603 

CTC CAA CAC TCT CTA CCG AAG ATC AAC CGG AGC GCT TCC GAG CCA TCC 1872 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
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610 615 620 

TTG CAT CGG GCA GCC CAC ACT GAG GAT ATC AAT GCT TGC ACG CTG ACC 1920 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

ACG TCC CCG AGG CTG CCT GTC TTC TAG 1947 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
15 10 15 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Aso Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 
K 130 135 1*0 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 16P 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 I/a 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 1 

Leu Leu Leu Phe Pro Asn Ser Thr He Gly Asp Ser Gly Val Pro Ala 
195 200 

Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 ZZ ° 

Pro Val Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 " 3 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 

245 z oU 
Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 



260 



Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 

275 ZoU 
Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 

290 295 
Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Val Ser Gly Thr Gin Glu Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 -" u 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 303 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 
Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 HA 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 

440 HH3 



435 



Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 



450 



455 
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Ile His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe G1y Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 " 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 
530 535 540 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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WHAT IS rT.RTMED IS : 

1 . A method of identifying an individual at an 
increased risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying an individual having one or more 
mutations in said- region as having an increased risk for 
developing cancer. 

2 The method according to claim 1, wherein said 
region encodes at least amino aoids 514 to 535 of SEQ ID 
NO: 12. 

3 The method according to claim 2, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
H0:12. 

4 The method according to claim 3, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO:12. 

5. The method according to claim 1, wherein said 
products are analyzed by DNA sequencing. 

6. The method according to claim 1, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR). 

7. The method according to claim 6, wherein said 
PCR employs a primer comprising SEQ ID N0:7 and a primer 
comprising SEQ ID NO: 8. 

8. A method for determining a prognosis in 
patients afflicted with cancer, comprising: 
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amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying a patient having no mutation in said 
region* as being less likely to suffer disease relapse or 
having an increased chance of survival than a patient 
having one or more mutations in said region. 

9. The method according to claim 8, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO: 12. 

10. The method according to claim 9, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO: 12. 

11. The method according to claim 10, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
N0:12. 

12. The method according to claim 9, wherein said 
products are analyzed by DNA sequencing. 

13. The method according to claim 9/ wherein said 
amplification is effected using polymerase chain 
reaction (PCR). 

14. The method according to claim 13, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

15. A method for determining the proper course of 
treatment for a patient afflicted with cancer, 
comprising 1 : 

amplifying a region of the c-raf-1 gene of said 
patient; 
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analyzing products of said amplification for 
evidence of mutation; 

identifying a patient having at least one mutation 
in said region, which patient may require treatment 
proper for a patient having a lesser chance of survival 
or decreased time to relapse; and 

identifying a patient lacking mutations in said 
region, which patient may require treatment proper for a 
patient having a greater chance of survival or being 
less likely to suffer disease relapse. 

16. The method according to claim 15, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO: 12. 

17. The method according to claim 16, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO: 12. 

18. The method according to claim 17, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
N0:12. 

19. The method according to claim 16, wherein said 
products are analyzed by DNA sequencing. 

20. The method according to claim 16, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR). 

21. The method according to claim 20, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 
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Figure 3 
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Figure 4 
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Figure 6 



1. Polymerase Chain Reaction (PCR) Amplification of Target DNA 
Either Genomic DNA or cDNA 

Nucleotide 1307 of coding Nucleotide 1326 of coding 

Primer 1 = 5'- AGGAGACCAAGTTTCAGATG -3' 

Nucleotide 1915 of coding Nucleotide 1896 of coding 

I I 

Primer 2 = 5'- GCGTGCAAGCATTGATATCC -3* 



PCR Cycles: 
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